IBM DataStages Transformers
Passo a passo de como iniciar sua jornada na Dadosfera
DataStage Transformers
This section documents all deterministic transformers for IBM DataStage stages.
Available Transformers
- CopyStageTransformer - Dataset and stage reads with column filtering
- LookupStageTransformer - Multi-table JOINs with view mappings
- ModifyStageTransformer - Column renames, keep, and drop operations
- JoinStageTransformer - SQL JOINs (LEFT, INNER, RIGHT, FULL)
- AggregatorStageTransformer - GROUP BY with aggregations
- RemoveDuplicatesTransformer - DISTINCT and ROW_NUMBER() deduplication
- FunnelStageTransformer - UNION ALL for multiple inputs
- InputStageTransformer - Oracle SQL conversion and table reads
- ImportStageTransformer - CSV/file reads mapped to tables
- OutputOracleTransformer - SELECT passthrough for output stages
- TransformerStageTransformer - Column derivations (hybrid with LLM)
Stage Type Mapping
| DataStage Stage Type | Transformer | SQL Output |
|---|---|---|
| COPY | CopyStageTransformer | SELECT with column filtering |
| LOOKUP | LookupStageTransformer | SELECT with multiple JOINs |
| MODIFY | ModifyStageTransformer | SELECT with renames/filters |
| JOIN | JoinStageTransformer | SELECT with JOIN |
| AGGREGATOR | AggregatorStageTransformer | SELECT with GROUP BY |
| REMOVE_DUPLICATES | RemoveDuplicatesTransformer | SELECT with DISTINCT or ROW_NUMBER() |
| FUNNEL | FunnelStageTransformer | UNION ALL |
| INPUT | InputStageTransformer | Oracle SQL → Snowflake SQL |
| IMPORT | ImportStageTransformer | SELECT from mapped table |
| OUTPUT_ORACLE | OutputOracleTransformer | SELECT passthrough |
| TRANSFORMER | TransformerStageTransformer | SELECT with derivations |
Common Features
All DataStage transformers:
- Parse
.dsxorchestrate code - Extract column definitions from
modifysections - Support parameter substitution (
[&"param"],#param#) - Generate Snowflake-compatible SQL
- Build output schemas from stage metadata
Documentation Format
Each transformer is documented with:
- Overview - What the transformer does
- Capabilities - Features and supported operations
- DataStage Stage Example - Real
.dsxformat input - Generated SQL Output - Actual SQL produced
- Output Schema - Resulting column definitions
- Limitations - Known issues and unsupported features
Next Steps
Start with CopyStageTransformer to see the documentation format with real examples.
Updated 4 days ago
